Reinforcement theory

Results: 290



#Item
11Statistics / Statistical theory / Estimation theory / Dynamic programming / Markov decision process / Stochastic control / Bias of an estimator / Reinforcement learning / Loss function / Fisher information

Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2012-10-01 18:27:53
12Game theory / Behavior / Decision theory / Psychology / Nash equilibrium / Behavioral economics / Norm / Reinforcement / Dynamic inconsistency / Trembling hand perfect equilibrium / Subgame perfect equilibrium / Economic equilibrium

POVERTY AND SELF-CONTROL B. Douglas Bernheim Stanford University and NBER Debraj Ray New York University

Add to Reading List

Source URL: thred.devecon.org

Language: English - Date: 2014-06-28 16:47:12
13Computational complexity theory / Theory of computation / Dynamic programming / Markov decision process / Stochastic control / Analysis of algorithms / Mathematical logic / Reinforcement learning / Time complexity / Algorithm / PP

Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a

Add to Reading List

Source URL: www.hieratic.eu

Language: English
14Theoretical computer science / Machine learning / Learning / Statistical classification / Support vector machine / Laughter / K-nearest neighbors algorithm / Algorithm / Computational learning theory / Artificial neural network / NP / Reinforcement learning

PREDICTING WHEN TO LAUGH WITH STRUCTURED CLASSIFICATION Bilal Piot1 , Olivier Pietquin2 , Matthieu Geist1 1 SUPELEC IMS-MaLIS research group and UMIGeorgiaTech - CNRS) 2

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-07-15 03:12:51
15Monte Carlo methods / Combinatorial game theory / Monte Carlo tree search / Statistical mechanics / General game playing / Reinforcement learning / Simulation / Thomas Nast / Artificial intelligence

Bandits all the way down: UCB1 as a simulation policy in Monte Carlo Tree Search Edward J. Powley, Daniel Whitehouse, and Peter I. Cowling Department of Computer Science York Centre for Complex Systems Analysis Universit

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
16Game theory / Reinforcement learning / Nash equilibrium / Q-learning / Strategy / Partially observable Markov decision process / Action selection / Best response / Bellman equation / Zero-sum game / Agent-based model / Solution concept

Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2009-03-02 16:24:03
17Algebraic geometry / Field theory / Valuation / Reinforcement learning / Differential topology

RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning Dharmashankar Subramanian IBM T. J. Watson Research Center Yorktown Heights, NY 10598

Add to Reading List

Source URL: marek.petrik.us

Language: English - Date: 2016-07-14 09:59:52
18Operations research / Dynamic programming / Mathematical optimization / Equations / Decision theory / Reinforcement learning / Markov decision process / Bellman equation / Policy / Partially observable Markov decision process

Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2014-02-04 20:03:22
19Machine learning / Computational linguistics / User interface techniques / Multimodal interaction / User interfaces / Reinforcement learning / Apprenticeship learning / Computational learning theory / Speech recognition / Intelligent agent / Dialog system / Dialog manager

Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46
20Markov processes / Markov models / Mathematical optimization / Stochastic control / Dynamic programming / Markov decision process / Beamforming / Reinforcement learning / Optimal control / Markov chain / Q-learning / Control theory

1 On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang

Add to Reading List

Source URL: www.comm.utoronto.ca

Language: English - Date: 2014-05-05 14:44:36
UPDATE